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Abstract 

B-series originated from the work of John Butcher in the 1960s as a tool to analyze numer- 
ical integration of differential equations, in particular Runge-Kutta methods. Connections 
to renormalization have been established in recent years. The algebraic structure of classical 
~C Runge-Kutta methods is described by the Connes-Kreimer Hopf algebra. 

Lie-Butcher theory is a generalization of B-series aimed at studying Lie-group integrators 
for differential equations evolving on manifolds. Lie-group integrators are based on general Lie 
i " i group actions on a manifold, and classical Runge-Kutta integrators appear in this setting as 

the special case of 1" acting upon itself by translations. Lie-Butcher theory combines classical 
£N| B-series on E n with Lie-series on manifolds. The underlying Hopf algebra Hm combines the 

^ Connes-Kreimer Hopf algebra with the shuffle Hopf algebra of free Lie algebras. 

C Aimed at a general mathematical audience, we give an introduction to Hopf algebraic 

structures and their relationship to structures appearing in numerical analysis. In particular 
we explore the close connection between Lie series, time-dependent Lie series and Lie-Butcher 
series for diffeomorphisms on manifolds. The role of the Euler and Dynkin idempotents in 
lf~) numerical analysis is discussed. A non-commutative version of a Faa di Bruno bialgebra is 

introduced, and the relation to non-commutative Bell polynomials is explored. 
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1 Outline of the paper 

The main point of this paper is to explore algebraic structures underlying groups of formal diffeo- 
morphisms on manifolds. The paper focuses on some important mathematical structures appear- 
ing in numerical integration on manifolds that arc likely to find applications also in other areas of 
mathematics. The relationship between classical Lie scries on manifolds, time-dependent Lie scries 
and Lie-Butcher scries is explained in detail. Furthermore, we develop the algebraic structures 
introduced in [331.38, 39, 4TJ [44j [43l[3] and in particular explore connections between Hopf- and Lie 
algebras, differential geometry and analysis of numerical integration on manifolds. Due to space 
limitation, the paper does not go into a detailed study of applications of these algebraic structures 
in numerical analysis, but we sketch briefly several of the many applications in this field. 

The introductory Chapters 2 and 3 give an overview of well-known results. Chapter 2 contains 
a brief introduction to numerical integration and algebraic structures appearing in numerical 
analysis, both classical methods on R n and Lie group methods generalizing to manifolds. Chapter 
3 presents a brief introduction to Hopf algebraic structures. 

Chapter 4 contains more new and recent material. A main goal is to detail the algebraic struc- 
tures of Lie Butcher theory, and discuss the interplay between algebraic and differential geometric 
viewpoints. In particular we want to emphasize the strong connections between the algebraic 
theory of Lie series, time-dependent Lie scries and Lie Butcher scries. Chapter 4 therefore starts 
with a discussion of classical Lie series and pullback formulas on manifolds, continuing with an 
exploration of some less known time-dependent pullback formulas. We will explain the relevance 
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of the Euler and Dynkin idempotents in numerics and introduce a non-commutative Dynkin-Faa 
di Bruno bialgebra, related to non-commutative Bell polynomials appearing in various contexts in 
earlier works: numerical analysis |38j , control theory |36j and quantization |31j . This Dynkin-Faa 
di Bruno bialgebra is related to, but different from, the Hopf algebras explored by Brouder et. al. 
in [B]. 

In the final part of Chapter 4, we discuss Lie-Butcher series. In particular we explore backward 
error analysis and the substitution law in the setting of algebras of non-commuting frames on 
manifolds. Although space does not allow for a detailed exposition and application of these 
subjects, we hope that this presentation will systematize the theory and open these topics for 
further future research. 

2 Introduction to numerical integrators and their analysis 

Let be a manifold and F : M. — > TAi a vector field. By the flow of an autonomous vector field 
F we mean the diffeomorphism *&t,F '■ •A'l — * defined for t <G M such that *&s,F°*&t,F — *&s-\-t,Fi 
$o,F = Id and d/dt\ t=Q $ 4 ,f(p) = F(p) for all p e M. 

Numerical integration of ODEs is about constructing good numerical approximations to <&t,F 
for a given vector field F. A numerical integration algorithm yields a diffeomorphism ^h,Fi 
henceforth called the numerical integrator. The real parameter h is called the stepsize. For an 
initial point yo € M, and a chosen stepsize h > 0, the numerical method produces a discrete 
sequence of solution points j/j = ^h,F{yi-i) with the goal of arriving at ~ <&kh,F(yo)- Note 
that, unlike the exact flow, numerical integrators are not 1-parameter Lie groups in h. In general 
we have ^/, : fO$ sf ^h+ Sl F, and ^-h,F F ■ Integrators for which the latter identity 

holds are called (time-) symmetric methods. Most integrators satisfy the consistency conditions 
^fo,F = Id and d/dt\ t _ ^t,F{p) = F(p) as well as scaling homogeneity ^h,F — ^imf- 

Many algebraic aspects of numerical integration is related to the computation of compositions, 
logarithms and exponentials of numerical integrators. In recent years surprising connections be- 
tween numerical analysis and renormalization in perturbative quantum field theory have been 
revealed. In this introduction, we will introduce some basic algebraic structures arising in the 
analysis of numerical integrators. In particular we will focus on structures that originate from 
the study of Lie group integrators, which are numerical integrators on general manifolds. The 
resulting theory combines Lie theory with the classical Butcher theory that describes numerical 
integrators on M. n . 

This first section presents a survey of well known results from numerical analysis. A detailed 
understanding of this introductory section is not necessary for reading the rest of the paper, and 
readers mainly interested in algebraic structures may jump directly to Section [3] 

2.1 Classical integrators 

In the early 1960s John Butcher set out on a journey to explore the algebraic territory of numerical 
algorithms for integrating ODEs evolving on vector spaces 

y'(t)=F(y), yeW 1 , F:R n ^R n . (2.1) 

In particular he studied the family of Runge-Kutta methods. Given a timestep h £ K, these 
methods advance the solution from j/o = 3/(0) to y\ ~ y{h) as: 

for r = 1 : s do 

Y r = J2k=l a rk.Fk + l/o 

F r = hF(Y r ) 

end 

Vi = Efe=l b k F k + 2/o ■ 



2 



This basic step is iterated: j/o 1— * Vi >- * • • • 1— * 2/n> with constant or variable stepsizes ft., until 
the final solution y„ ss y(^n) is reached. The coefficients a r fc and for r, /c £ {1, . . . , s} define a 
particular s-stage RK method. 

A goal of numerical analysis is to characterize coefficients a r k and bk which yield 'good' methods 
(when applied to a given class of differential equations). The view of what a good integration 
method is has, however, evolved over the last decades. Traditionally, order theory and stability 
were the most important properties to consider. A numerical integrator is of order p if the first 
p+ 1 terms of the Taylor expansion of the analytical solution agrees with the first p+ 1 terms of the 
numerical method (developed in the parameter h). Requiring a certain order results in algebraic 
conditions, called order conditions, on the coefficients of the method. 



Many numerical methods for solving the equation (2.1l can be studied by using B-series (see 
e.g. |25j, introduced by Hairer and Wanner in 1974 [26 . A B-series is a (formal) series indexed 
over the set of rooted trees T and can for a vector field F be written as 

B h (a)(y) = a(l)y + ]T ^-a(r)T F (r)(y). (2.2) 

Here a is a map a : T — > R, I is the empty tree, |r| is the number of vertices of r (the order of 
the tree r) and a is a certain symmetry factor. The map J-p(r) : K™ — > K" is the elementary 
differential of the tree r and is given recursively as follows: 

T F {r) = FW(F F (n)(y),...,F F (T m )(y))(y), (2.3) 

where r = B + (t\, r m ) is the tree constructed by adding a common root to the subtrees Ti . . . r m , 
and F^" 1 ' is the m'th derivative of the vector field. 

One way in which B-series can be applied to the study of numerical methods is to order theory. 
For example, the order conditions for Runge-Kutta methods can easily be obtained by writing 
the method as a B-series and then comparing the coefficients of this series with the exact solution 
written as a B-series (see e.g. Chapter. III. 1.2]). 

The composition of Runge-Kutta methods is also of great interest, and this leads to the study 
of the composition of B-series. A series Bh F (a) is inserted into another series Bh F (b), and this 
gives the B-series B hF (a)(B hF (y)(b)) = B hF (a ■ b)(y). The resulting product a ■ b gives rise to a 
group, called the Butcher group PUB], 

Butcher realized early that the set of Runge-Kutta methods forms a group, and characterized 
algebraically the composition and inverse in this group. Much later, this group was identified with 
the character group of the Connes-Kreimer Hopf algebra [17j HH [5] . 

In recent years the importance of preserving various geometric properties of the underlying con- 
tinuous dynamical system has become better understood. The research topic Geometric Numerical 
Integration [25] emphasizes this view. Geometric integration algorithms have been successfully de- 
veloped for various classes of differential equations, such as volume preserving flows, Hamiltonian 
equations, systems with first integrals and equations evolving on manifolds. An important tool 
for investigating the geometrical properties of a numerical integrator is through its backward error 
analysis. For a given numerical method ^h,Fi we seek a series expansion of a modified vector field 
(h, F) i— ► Fh such that the numerical solution equal^the analytical flow of the modified vector 
field, IVf = $ i# 

This is computed as a formal logarithm F^ = Log^h.p), in Hopf algebraic language expressed 



by the Eulerian idempotent (Section 3.2.21 



Still another idea, which has been developed in [14] . is to ask for a series development of 
a modified vector field Fh, such that when the numerical method is applied to Fh, the exact 
analytical solution is produced: ^ hFh = &h,F- This has been taken much further in recent 

lr The series for is a formal series which usually does not converge. By truncating the series at an optimal 
point we find a modified equation which is exponentially close to the numerical solution, see |25| . In this paper we 
deal only with formal series, and convergence is not considered. 
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work |15[ll0j. The algebraic operation (h,F) i— > Fh is commonly referred to as a substitution law. 
The Hopf algebra of the substitution law is introduced in [TU] . 

B-series is often a very important component in a numerical analyst's toolbox, and is used 
to study order theory, backward error analysis, modified vector fields and structure preserving 
properties of numerical integrators. 

2.2 Lie group integrators 

Numerical Lie group integrators for ODEs is a generalization of numerical integration of ODEs 
from the classical setting of equations on R" to differential equations on manifolds. In this section, 
we will give a short introduction to this topic. See [28 for an extensive survey. 

2.2.1 Exponential Euler method 

Let M denote a manifold, XM its Lie algebra of vector fields with the Jacobi bracket and Diff (M) 
the group of diffcomorphisms on M. Let exp : XM — > Diff(.M) denote the flow operator. We 
want to numerically integrate an ODE on M, given as 

y'(t)=F(y), y(0) = yo for F G XM, (2.4) 

having the analytical solution y(t) — exp(tF)-yo. Here exp(/F)-?/o denotes the evaluation of the 
diffcomorphism exp(/F) at yo G M. 

Assumption 2.1. The fundamental assumption for numerical Lie group integrators is the exis- 
tence of a subalgebra g C XM. such that 

• All vector fields V G g can be exponentiated exactly. 

• The Lie algebra g defines a frame on TM, i.e. g spans the tangentspace T p M at all points 
p G M. Ln other words, the action generated by g is transitive on M . 

The vector fields in g are called the frozen vector fields. Due to the frame assumption, we can 
always express the vector field F and the ODE \2A\ in terms of frozen vector fields via a function 
/ : M — > g as F(y) = f(y)-y, where f(y) G XM and f(y)-y denotes evaluation of this vector field 



in y. Thus (2.4 1 can be written in the form 

y , (t) = f(y)-y, y(o) = y eM. (2.5) 

In the case where g forms a basis for T y M, the function f{y) is uniquely defined. In more general 
situations, g is an overdetermined frame for T y M, and there is a freedom in the choice of /. This is 
called a choice of isotropy, and is of major importance for the quality of the numerical integrator. 



With the equation written as (2.5), we can present the simplest of all Lie group integrators, 
the exponential Euler method. Given a timestep h G K, the method advances the solution from 
Vo = 2/(0) to 2/1 « y(h) as: 

Algorithm 2.2 (Exponential Euler). 

2/i = exp(hf(y Q ))-y . 

In each step the method advances the solution yy. i— > 2/fc+i by integrating the frozen vector field 
equation 

y'(t) = f(y k )-y, y(0) = y k 

from t = to Z = h. We will in the sequel present methods of higher order and with superior 
qualities compared to this simple scheme. The main theme of the paper is the algebraic structures 
arising from the numerical analysis of such integration schemes. 
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2.2.2 Choosing a good action 

In practice it is of importance that exp: g — ► DiS(M) can be computed fast, and furthermore 
that the given vector field F(y) is locally well approximated by f(yo)-y. Exactly what 'well' 
means, depends on what we want to achieve. In many situations we can choose g so that certain 
first integrals of the original system are exactly preserved by the frozen flows. Choosing g and 
/ : M. — > g is in many ways similar to choosing a preconditioner in iterative methods for solving 
linear equations; we want a good approximation which is easy to compute. 

A simple choice of g is obtained by embedding M. C R" and choosing g = K™ as the (commuta- 
tive) algebra generated by {d/dxj} 7 f =1 , i.e. the constant vector fields on ~R n . Since the vector fields 
are constant, we have f(yo)-y — f(yo), so the function / simply becomes f(yo) = F(yo) £ M. n , 
all commutators in g vanish and the exponential on g is exp(V)-p — V + p for V,p £ W 1 . In this 
case all Lie group integrators will reduce to classical integrators, e.g. exponential Euler becomes 
the classical Euler yi — hF(yo) + yo. 

The other extreme is g = XM. and f(y) = F for all y, in which case exponential Euler yields 
the analytical solution exactly. However, in this case the computation of the exponential on g is 
just as difficult as solving the original equation. We seek efficient choices in between these two 
extremes. 

In many cases g is given as the infinitesimal generators of a (e.g. left) Lie group action on 
M. For example, consider the sphere M = S 2 acted upon from left by the group G — SO(3) 
of orthogonal 3x3 matrices, with the Lie algebra so(3) consisting of skew 3x3 matrices. Any 
matrix V £ so(3) is uniquely identified with the infinitesimal generator]^] £y £ XM, given by 
matrix-vector multiplication £yj/ = Vy for y 6 S 2 . Therefore ( |2.5[ ) becomes y'{t) — V(y)y, 
where V(y) is a skew symmetric matrix. The exponentiation is related to the matrix exponential 
cxpm(y) as exp(£y)-y = expm(V r )y. 

Another important example of group actions arise in the solution of isospectral differential 
equations, where GL(n) acts on gl(n) by the adjoint action (similarity transform) A-Y — AYA^ 1 
for A £ GL(rt), Y £ gl(n). In these problems M C gl(n) is one of the (isospectral) orbits 



of the action. For this action (2.5) acquires the isospectral form Y'(t) = [B(Y),Y] for some 
B(Y) £ gl(n). Since the action is a similarity transform it is guaranteed that all the eigenvalues 
of Y(t) are preserved also by the numerical integrator. 

Yet another example, which occurs in Lie-Poisson problems in computational mechanics, is 
the coadjoint action of a Lie group on the dual of its Lie algebra, g*. In this case M C g* is 
a coadjoint orbit. Using this action we can guarantee that the numerical Lie group integrator 
exactly preserves the Casimirs of the continuous system. 

For other problems it may be advantageous to choose g by simplifying the original equation to 
a family of integrable equations. A simple example is the computation of the motion of charged 
particles in a magnetic field. The solution in the case of constant magnetic fields is given by helical 
motions around the field lines. The corresponding Lie algebra yields fast and accurate Lie group 
integrators for the full problem of non-constant magnetic fields. Another example is integration 
of a spinning top, where we obtain simpler equations by considering the direction of gravity as 
being constant in body coordinates. In both these problems, the action preserves important first 
integrals of the system. A third example is integration of stiff equations on W 1 , where an integrable 
Lie algebra is obtained by considering all affine linear vector fields. This connects the theory of 
Lie group integrators with the so-called exponential integrators. See |28j for details. 

2.2.3 Higher order methods 



Most Lie group methods for integrating (2.5 1 are built from linear operations and commutators 



in g and compution of flows of frozen vector fields (exponentials). Runge-Kutta type methods 



2 Recall that the identification of the Lie algebra of a left group action with the infinitesimal generator in XM 
is an anti-homomorphism, [£v,£vk] = —£,[v,w]- in this paper the brackets are Jacobi brackets on XM, and some 
signs may differ compared with cited papers. 
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with basic motions expressed in terms of an exponential of a sum of elements in g are commonly 
referred to as RKMK methods [35], as in the following example: 

Algorithm 2.3 (4'th order RKMK from [38]). 

Yi = y F t = hf{Y 1 ) 

Y 2 = exp(iF 1 )- 2 / F 2 = hf(Y 2 ) 

Y 3 = exp(iF 2 + ±[F u F 2 ])-y F 3 = hf(Y 3 ) 

y 4 = exp(F 3 + i [Fx, F 3 ] )-y F 4 = hf(Y 4 



4 



4 



V=\F 1+ §(F 2 + F 3 ) + ±F 4 I = ^ + i(F 2 + F 3 ) - &F< 
yx = exp(V + [I,V])-y . 

Methods where the basic motions are products of exponentials of simple elements in g are 
called Crouch-Grossman methods 1181 1441. 



Algorithm 2.4 (3'rd order Crouch-Grossman method from [44]). 

Yx = y F 1 = hf{Y x ) 

r 2 = exp(|F 1 ). 2/0 F 2 = hf(Y 2 ) 

Y 3 = exp(ii|F 2 ).exp( 1 ^F 1 ). 2 / F 3 = hf(Y 3 ) 
y x = exp(ifF 3 )-exp(-|F 2 )-exp(f|F 3 ) 2 /o- 

More recently methods have been developed which combine exponentials of sums and products 
of exponentials, as in the commutator free Lie group methods |13j . An example is: 

Algorithm 2.5 (4'th order commutator free method from [13 ). 

Yx = y Fx = hf(y ) 

y 2 = exp(iF 1 )-y F 2 = hf(Y 2 ) 

y 3 = exp(iF 2 )- 2/0 F 3 = hf(Y 3 ) 

Y 4 = expC-iFx + F 3 )-Y 2 F 4 = hf(Y 4 ) 

yx = eMjFx + \{F 2 + F 3 ) - ^f 4 ).exp(-^Ji + \{F 2 + F 3 ) + \F 4 )-y . 

For equations of Lie type y'{t) = f(t) ■ y, numerical methods based on Magnus and Fer ex- 
pansions have been developed in [551 HZ] > an d the algebraic theory has recently been developed 
further in }21j . 

To study order conditions, backward error analysis and structure preservation of such methods, 
it is important to understand B-series in a general setting of group actions on manifolds. A first 
attempt at combining Lie and B-series in a common mathematical framework appeared in [37U38] . 
Hopf-algebraic aspects have been explored further in [39, 3, 41.. 



3 Hopf algebras 

This section is meant as a short collection of facts and properties of Hopf algebras that we will 
use in this work. For a more thorough introduction, see e.g. pQ, [34], [47], [30], [TTj . 

3.1 Basic definitions 

Let A: be a field containing Q. 

Definition 3.1. A k-algebra A consists of a fc-vector space A together with two maps /i : A® A — * 
A and r\ : k —> A, called the product and the unit of A, such that: 

(i) [i is associative, i.e. /io(/®^i) = ^, o (/x (g) I) , 

(ii) the composites A = A g) k ^> A ® A A A and A = A ® k A <g> A A A both equal /. 



G 



Here 7 denotes the identity map. 

An algebra A is called commutative if fj, o r = [i, where r is the flip map t(<zi (8 02) =02® oli- 

Definition 3.2. A k-coalgebra C is a fc-vector space equipped with two maps A : C — > C ® C 
and e : C — > fc, the coproduct and the counit, such that: 

(i) A is coassociative, i.e. (A ® 7) o A = (7 ® A) o A, 

(ii) the composites C^C®C^C®k^C and C ^ C ®C ^ k®C both equal 1. 

A coalgebra C is called cocommutative if r o A = A. 

Definition 3.3. A bialgebra 77 over /c is a /c-vector space equipped with both an algebra (77, //, 77) 
and a coalgebra structure (77, A, e), such that the coproduct A : 77 — > 77 ® 77 and the counit 
e : 77 — > fc are algebra morphisms. These compatibility conditions can be expressed in terms of 
the following commutative diagrams, where r denotes the flip operation r(/ii, /12) = (^2, 



77 <g> 77 <g> 77 ® 77 — ► 77 <g 77 ® 77 ® 77 

77®77 k®k 



77®77 ► 77 ►77(g)77 p 



t 

77 ► fc 



A bialgebra 77 is called commutative if it is commutative as an algebra, and cocommutative if 
it is cocommutative as a coalgebra. 

Remark 3.4. There is symmetry in the definition of a bialgebra. Rather than requiring the 
coalgebra structure to respect the algebra structure in the above sense, we could have switched 
the role of the two structures. To complete the symmetry, we could in addition reverse the arrows 
in the two diagrams above. This would result in an equivalent definition. 

Grading. Let 77 be a graded fc-vector space, i.e. 77 = ©„> 77„. There is a notion of a graded 
bialgebra, obtained by requiring the following of the algebra and coalgebra structure, respectively: 

(i) n{H p ,H q ) C H p+q 

(ii) A(77„)c0 p+9= „77 p ®77 9 . 

The grading of an algebra 77 gives rise to the grading operator Y : 77 — > 77 given by 

Y : h 1 — ► khk, 

k>0 

where h = X) n >o ^« ^ ©«>o ^n- ^ g r &ded bialgebra 77 = © n>0 H n is called connected if 77 = k. 

Proposition 3.5 ( |34j ) . Let 77 be a connected, graded bialgebra. Then, for any x € 77„, n > 0, 
we have: 

Ax — 1 <g) x + x ® 1 + Ax, where Ax £ (J) H p ®H q . 

p+q=n, p,q>0 

We will often use the Sweedler notation for the coproduct: 

Ai = ^ ® ji(2) an d Ax = %' (8> x". 

(x) (x) 



3 All diagrams were created using Paul Taylor's diagram package, available from 
http : //www.paultaylor . eu/diagrams/ 
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Definition 3.6. A Hopf algebra is a bialgebra (H, [i, rj, A, e) together with an antihomomorphism 
S on H, called the antipode, with the property given by the commutativity of the following 
diagram: 



S®1 

H(g)H - H ®H 




A Hopf algebra is graded if it is graded as a bialgebra and the antipode satisifies S(H n ) C H n . 
If a bialgebra is graded and connected, then it is automatically a graded Hopf algebra: 

Proposition 3.7 (|34j). Any connected graded bialgebra is a Hopf algebra. The antipode S is 
given recursively by S{1) = 1 and 

S(x)=-x-^2S(x')x" 

(x) 

for x € ker e. 

Example 3.8. The Shuffle Hopf algebra. Recurring in the sequel are Hopf algebras built 
from letters in an alphabet. We follow the notation of Reutenauer [46 . Consider a finite or 
infinite alphabet of letters A = {a, b,c,.. .}. We write A* for the collection of all empty or non- 
empty words over A, where I is the empty word. Let k(A) be the fc-algebra of non-commutative 
polynomials in A. A polynomial P G k(A) will be written as a sum 

u>£A* 

where (P,ui) € k is non-zero only for a finite number of u. Let P, Q S k(A). The product of P 
and Q, written as PQ, has coefficients 

(PQ,u)= (P,u)(Q,v). 

ui=uv 

The k- linear dual space denoted k((A}} := Homfc(fc(„4), k) is identified with all infinite /c-linear 
combinations of words. An a £ k((A)) can be written as an infinite series 

where (a, ui) = a(oj) € k and (•, •) is the dual pairing defined such that words in A* are orthogonal, 
(WX,(J2) = ^i,u;2 for a11 wi,w 2 eA*. 

We define two different associative products on k(A). The concatenation product uji,u>2 i— * 
LO1LO2 obtained by concatination of words and the shuffle product wi,W2 ^ wi U L02 obtained by 
linearly combining all possible shuffles of the two words i.e. combinations where the letters within 
each word are not internally permuted, e.g. 

abc U de = abcde + abdce + adbce + dabce + abdec + adbec + dabec + adebc + daebc + deabc. 



The shuffle product can be defined recursively as 

((iwi) U (bu>2) — a(u>i U bu>2) + h(au>\ U 012), 
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where a, b G A and u>\, u>2 € .4* . The unit of both concatenation and shuffle is the empty word I. 

By dualization of these products we obtain the deconcatenation and the deshuffle coproducts. 
The deconcatination coproduct Ad : k(A) — > k(A)<S)k(A), defined for uj — a\a 2 ■ ■ ■ a k S A* as: 

fc 

A d (u) =^ai- ■ ■ a,i®a i+1 - ■ ■ a k . (3.1) 
i=l 

This coproduct is the dual of the concatenation product, so for any P,Q G k(A) 
(PQ,w) = (P®Q,A d (w)) = ( P ^(i))(Q^(2))- 

The deshuffle product A u : k(A) — > fc(.A)<£>fc(.4) is similarly defined such that 
(PL\Q,w) = (P®Q, A u (o;)) = J! (J> (1) )(Q,u; (2) ). 

The two coproducts can also be characterized by requiring that the letters in the alphabet A 
are primitive, i.e. that A (a) — l£g>a+a(g>l for a € A, and then extending A to be a homomorphism 
with respect to either of the two products on k(A). We refer to [IB] for explicit presentations of 
the deshuffle coproduct. 

We remark that the vector space k(A) can now be turned into Hopf algebras in two different 
ways. The cocommutative concatenation Hopf algebra is obtained by taking the concatenation as 
product and the deshuffle as coproduct. The commutative shuffle Hopf algebra Tlg^A) is obtained 
by taking the shuffle as product and the deconcatenation as coproduct. Both these Hopf algebras 
share the same antipode: 

S(a 1 a 2 ■ . . a k ) = (-l) k a k a k -i ...a\. (3.2) 

In both cases the unit and counit is given by ?y(l) = I and e(I) = 1, e(uj) = for all ui £ A*\l. We 
write 7igh when A is understood. 

The vector space k(A) can be identified with the vector space underlying the tensor algebra 
T(V) on the vector space V generated by the alphabet A. The two algebra structures (concati- 
nation and shuffling) correspond to the usual algebra structures given to the tensor algebra T(V) 
and the tensor coalgebra T C (V), respectively. 

3.2 Characters and endomorphisms 

This section is based on [5U]. See also [53], [7] and [35] . 

Let (H, /i, r), A, e) be a graded bialgebra, and (A, 7]a) an algebra. The set Hom k (H, A) of linear 
maps from H to A sending ?7h(1) =: Ih to — : has an algebra structure given by the 

convolution product: 

a * (3 = ha ° (« ® P) ° A. 

The convolutional unit is the composition of the counit of H and the unit of A: 5 :— t)a ° e - The 
convolution can be written using the Sweedler notation: 

a* (3 = 2^a(z (1 )) ■ 0{%{2)), 

(x) 

from which we find a*S = 5*a = a. The unital algebra morphisms from H to A consists of all 
a € Homfc(ff, A) such that a(ljj) = 1a and a(n(h,h')) — a(h)-a(h') for all h,h! € H. 

Proposition 3.9 (|34j). Let H be a graded Hopf algebra and A a commutative algebra. The set 
Horn Aig(H, A) of unital algebra morphisms from H to A, equipped with the convolution product, 
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forms a group, G(H,A), called the group of A- valued characters of H. The inverse of an element 
a is given by 

a*^ 1 — a o S, 

where S is the antipode of H. 

In the special case A = fc, we get the group of characters of H, written as G(H) := G(H, fc). 
The grading on H splits the group of A- valued characters into graded components: 

G(H,A) £* Y[ttom Alg (H n ,A). 

n>0 

This is not a graded vector space, but rather the completion of one (see e.g. [10]), but we will still 
call it a graded vector space. The restriction of a character a : H — > A to the degree n component 
H n of H will be denoted by a n . 

3.2.1 Infinitesimal characters, the exponential and the logarithm 

The infinitesimal A-valued characters, written q(H, A) are the linear maps a from H to A such 
that: 

a(n(h, h')) = a(h) ■ 8{h!) + 5(h) ■ a{h'), 

where 5 — t]a ° e. This is a Lie algebra under the bracket induced by the convolution product: 
[a, f3) — a * (3 — (3 * a. In the special case where A — k we write g(H) := g(H, fc). 

The characters and the infinitesimal characters are connected via the exponential and the 
logarithmic map. Let a G Homfe(_ff, A). The exponential and logarithm with respect to convolution 
are given by the formal series: 

exp» - 

^— ' n! 

n>0 

log*(<5 + a) = Y [ -^ a* n . 

n>l 

If iJ is graded and connected, and if a(l) = 0, where 1 e fc = i?o, then a* k = a * ■ ■ ■ * a = on 
ff n for n < fc, and therefore both these sums are finite when restricted to H n , provided a(l) = 0. 
The maps exp* and log* give a bijection between G(H,A) and g(iJ, A). 

Example 3.10. Let Hsh denote the shuffle algebra over A. Consider the dual space k((A)) 
equipped with the convolution product 

(a*/3,u>)= ^ (a ! w (1 ))(/3,W( 2 )) = ^ (a,wi)(/3,cj 2 ). 

Note that convolution is just concatenation of series a * (3 = a/3. The characters and infinitesimal 
characters g(Hsh): G(Hsh) C k((A}} are given as 

flOftsh) = {a 6 fc((^)) | a(I) =0 and a(wiUw 2 ) =0 for all u u u 2 € A*\l} 
G(Tish) = {ae k((A)) | a (I) = 1 and a(o;i U w 2 ) = a(wi)a(w 2 ) for all wi,w 2 € >t*\I}. 

The convolutional unit <5 is given as (6, 1) = 1 and (<5, w) — for all a; e ^4\I. The logarithm of 

a G G(Ttsh) can be computed as log(a) = X) n >o — n — ( a — ^)* n - For any uj G A* we find that 
(log(a), w) is given by a finite sum expressed in terms of the Eulerian idempotent. 
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3.2.2 Eulerian idempotent 

Let H be a commutative, connected and graded Hopf algebra. Consider Endfe(i/) = Homfc(iJ, H) 
equipped with the convolution product *. Let Id G Endfc(_ff) be the identity endomorphism and 
6 = r\ o e G Endfe(iJ) the unit of convolution. 

Definition 3.11 ([32 ). The Eulerian idempotent e G End(_ff) is given by the formal power series 

e := log*(Id) = J- ^ + ^ + ... (-l)W^! + ■ ■ • , 
2 3 z 

where J = Id — (5. 

Proposition 3.12 -fbr any commutative graded Hopf algebra H, the element e G Endfc(iJ) 

defined above is an idempotent: e o e — e. 

The practical importance of the Eulerian idempotent in numerical analysis arises in backward 
error analysis, where the following lemma provides a computational formula for the logarithm: 

Proposition 3.13. For a G G(H) and h G H , we have 

log*(a)(h)=a(e(h)). 

In other words, the logarithm can be written as right composition with the eulerian idempotent: 

log* = _oe:G(F)^ fl (F). 
The result follows from the following computation, which uses that a is a homomorphism: 
((a -d)* l ,w)= /i/c°(a<g) • • • (g)a)oA'w = aofi l H oA l u = aoJ* l u, 
where (— )' denotes Z-fold application. 

3.2.3 The graded Dynkin operator 

There is another bijection between the infinitesimal characters and the characters in any commu- 
tative graded Hopf algebra H, described in [20] . The bijection is given in terms of the Dynkin 
operator D : H —> H. 

Classically, the Dynkin operator is a map D : k(A) — > Lie(A), where Lie(A) = g(7ish) H k{A) 
are the Lie polynomials. The classical Dynkin operator is given by left-to-right bracketing: 

D(a 1 ...a n ) = [... [[ai,a 2 ],a 3 ], . . . ,a n ], where [a^dj] = aiOj - a^. 

Letting Y(ui) = f^(uj)oj denote grading operator, where is word length, it is known that the 

Dynkin idempotent, given as Y~ 1 D, is an idempotent projection on the subspace of Lie polynomi- 
als. As in [50], the Dynkin operator can be written as the convolution of the antipode S and the 
grading operator D — S *Y . This description can be generalized to any graded, connected and 
commutative Hopf algebra H: 

Definition 3.14. Let H be a graded, commutative and connected Hopf algebra with grading 
operator Y : H — > H. The Dynkin operator is the map D : H — > H given as 

D := S * Y. 

Lemma 3.15 ( 20J). The Dynkin operator is a H-valued infinitesimal character of H. 

Theorem 3.16 ([20 ). Right composition with the Dynkin operator induces a bijection between 
G(H) and q{H): 

_oD:G(H)^ Q (H). 
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The inverse is given by T : g(H) — ► G(H) as 



r («) = E 



E 



afei * • • • * 0L kl 



k\ + ■ ■ ■ + ki = n, 
kx,...,ki > 



k 1 (k 1 + k 2 )---(ki + --- + 



(3.3) 



where = ot\ Hk . 

Later we will apply the Dynkin operator and its inverse in the setting of a shuffle algebra 
7ish(OT), where OT is an alphabet of all ordered rooted trees, and the grading |t| of r € OT 
counts the nodes in the tree. 



4 Algebras of formal diffeomorphisms on manifolds 

The main goal of this section is to arrive at Lie Butcher series and the underlying Hopf algebra Hat. 
This Hopf algebra contains the Connes-Kreimer Hopf algebra as a subalgebra and is also closely 
related to 7ish- To emphasize the natural connection between Lie-Butcher series, H.N and more 
classical Lie series, we start with a discussions of Lie series (autonomous and non-autonomous). 



4.1 Autonomous Lie series 

In this section we review the well-known theory of Lie series on manifolds and the corresponding 
Hopf algebraic structures of the free Lie algebra. The algebraic theory is detailed in QT] and 
for the analytical theory we refer to [2J. 

Let J 7, be a vector field on a manifold M. and $t,F '■ -M. — ► M. its flow. Let tp : Ai — > E be a 
section of a vector bundle over A4, and let F ip denote the pullback. For the applications later 
in this paper we will only consider trivial bundles, in which case we write tp : M — * V for some 
vector space V and define pullback as composition F ip = i/jo<f> t F . The Lie derivative of ?A is 
defined as 

d 



F ^ = Ft 



Composition of Lie derivatives defines an associative, non-commutative product of vector fields 
F,G i— > FG, where vector fields are first order differential operators. The product FG is the 
second order differential operator (FG)[ip] = F[G[tp]] etc. We let I denote the O'th order identity 
operator l[ip] = tjj. The linear span of all differential operators of all orders forms the universal 
enveloping algebra U(XM.). 

The basic pullback formula is [2 

JU* F ^ = $* F (i^]). (4.2) 

Iterating this we find 9™/3t n | 4=0 $* F tp = F[F[- ■ ■ [ip]]] := F n [ip], and hence follows the (Taylor)- 
Lie form of a pullback series: 

°° fj 

3=0 3 ' 

Fundamental questions are: Which series in U(XM.) represent vector fields and which represent 
pullback series? How do we algebraically characterize compositions and the inverse of pullback 
series? How do we understand the Exp map taking vector fields to their pullback series, and 
the inverse Log operation? These questions are elegantly answered in terms of the shuffle Hopf 
algebra. We will detail these issues, and see that the same structures reappear in the discussion 
of B-series later. 



12 



An algebraic abstraction of Lie series starts with fixing a (finite or infinite) alphabet A and a 
map v: A — > XM, assigning each letter to a vector field. As in Example |3.8| we let WL(A) denote 
all finite M-linear combinations of words built from A and 7ish the shuffle algebra. The map v 
can be uniquely extended to a linear T v : R(A) — > U(XM) as a concatenation homomorphism: 

= I, 

J- U (a) — v(a) for all letters a £ A, 
T v {u\tjJ2) = T v {uj\)T u (ijJ-i) for all words uj\ , u>2 G A* ■ 

We extend T v to a map i3 t taking an infinite series a € M((^4)) to an infinite formal series 
B t (a) e f/(<Y M)*, defined for t e M as follows: Consider the alphabet A with a grading \a\ e N+ 
for all a & A. This extends to Hsh as \u>\ — \ai \ + . . . + |afe| for all w = ai . . . £ A*, |I| = 0, thus 
?ish becomes a graded connected Hopf algebra. Given the grading we define 

B t (a) = J2 tMa(u)r v (u>). (4.3) 



Consider Tlsh* = ^((^4)) with the convolution a * [3 = aft as in Example 3.10 By construction 
Bt is a convolution homomorphism, 

Bt(a*(3) = B t (a)B t ((3). 

For a real valued infinitesimal character a € fl(Hsh), an d a fixed t = h, Bh(a) is a formal vector 
field on M.. For a real valued character j3 £ G(Hsh), Bh(P) represents a formal diffeomorphism 
$/! on Al via the pullback series 

B h (p) ty>] = i)o^ h for if; : M -»• M. 

Note, however, that pullbacks compose contravariantly with respect to composition of diffeomor- 
phisms: 

B h {Pi * /3 2 )M = BhiPJBhifoM = ^o$ 2 o$!. 

Thus to summarize: Composition of diffcomorphisms is modelled by convolution in G(7ish) 
(in opposite order), the inverse of a diffeomorphism is computed by right composing with the 
antipode, the convolutional exponential maps to the exponential of Lie series and the logarithm 
is computed by composing with the Eulerian idempotent, 

B h ((3oS)B h ((3) = I for (3 E G(H Sh ) 

B h (exjp*(a)) = Exp(6 h (a)) for a e fl(W Sh ) 

B h (f3) = E X p(B h (/3oe)) for p e G(W Sh ). 

In the next section we discuss flows of non-autonomous equations, and we will see that right 
composition with the Dynkin idempotent represents algebraically the operation of finding a non- 
autonomous vector field corresponding to a diffeomorphism on a manifold. 

Remark 4.1. In [40 the Lie algebra of infinitesimal characters g(7ish) is studied as a graded free 
Lie algebra. An explicit formula for the dimension of the homogeneous components = 0(7ish)|fc 
is derived for general gradings. This is very useful for the study of the complexity of Lie group 
integrators. 



4.2 Time-dependent Lie series 

The classical Faa di Bruno Hopf algebra models the composition of formal diffcomorphisms on E 
( 23J, [25], [23]). In this section we will see that this has a natural generalization to compositions 
of time-dependent flows on manifolds. We introduce a Dynkin~Faa di Bruno bialgebra describing 
the composition of flows of time-dependent vector fields on a coarse level that considers only the 
grading of the terms in the ^-expansion of the time-dependent vector fields. 
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4.2.1 Non-commutative Bell polynomials and Dynkin Faa di Bruno bi-algebra 

Let T — {dj}°^ 1 be an infinite alphabet in 1-1 correspondence with N + , and consider the free 
associative algebra T> = R(T) with the grading given by \dj \ = j and \dj 1 ■ ■ ■ dj k \ = ji + ■ ■ ■ + jk- 
Let d: T> — > T> be the derivation given by d(di) = d,+x, linearity and the Leibniz rule 8(10x^2) = 
d(u>i)u}2 + u>xd(u} 2 ) for all 0Jxi^2 <E 1* . We let #(u>) denote the length of the word u>. 

Definition 4.2. The non-commutative Bell polynomials B n = B n (di, . . . , d n ) € R(X) are defined 
by the recursion 

Bo = I 

B n = (dx + d)B n _i = (d x + d) n l for n > 0. 

The first of these are given as 

B = I 

B x = dx 

B 2 = d\ + d 2 

B 3 = dl + 2dxd 2 + d 2 dx + d 3 

Bx = df + 3djd 2 + 2dxd 2 dx + d 2 dj + 3dxd 3 + d 3 dx + 3d 2 d 2 + d A . 

The polynomials B n are introduced in |37l I38j to explain the Butcher order theory of Runge- 
Kutta methods in a manifold context, and generalize to certain classes of numerical integrators 
on manifolds. 

Remark 4.3. Additional insight to the Bell polynomials are obtained by considering the free 
associative algebra generated by two symbols dx and d, defining 

di := [d, di-x] = ddi-x — <h-xd for i > 1. 

We find by induction that (dx + d) n satisfies the binomial relation 

(dx + d) n = (f) Mdi,- ■ ■ , d k )d n ~ k , (4.4) 



fe=0 



which yields the formula 



exp (dx + d) = £ B ™(di,---,d m ) ^ 



m=0 



and also the recursion 



B n+ x(dx,...,d n +x) =^ ( j-Bfe(dx,...,dfc)d„-fc +1 for n > 0. (4.6) 
k=o ^ ' 

The non-commutative partial Bell polynomials B n j, = B n k (dx, ■ ■ ■ , d„_fc+i) are defined as the 
part of B n consisting of the words u> of length #(w) — k > 0, e.g. -B4 3 = 3d^d 2 + 2d 1 d 2 d 1 + d 2 d\. 
Thus 

n 

B n = ^ £>n,fc- 
k=l 

A bit of combinatorics yields an explicit formula: 

B n , k = (4.7) 



|u|=n,#(w)=ft 
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where for uj = dj 1 dj 2 ■ ■ ■ dj k 



n\ n \ n 



v<*V \\djil\d j3 \,...,\d jh \J • • • j'fe! 

are the multinomial coefficients and the coefficients k(lo) are defined as 

KH S ' ' ^ l} := ii(ji + i a ).^x+i + --. + J fc )- (4 - 8) 

The coefficients k form a partition of unity on the symmetric group Sk , 

E = 

<T6S fc 

where tr(w) denotes a permutation of the letters in ui. E.g. k(1,2) + k(2, 1) = | + | = 1. 

It is often useful to employ polynomials Q n and Q n> k related to B n and by the following 
rescaling: 

Q n , k (di,...,d n -k+i) = ^B nt k(l\di,...,j\dj,...) = E k(w)ui (4.9) 

"' | w |=n,#(w)=fc 
n 

Q n (d!,...,d n ) = ^QnAdlf-'dn-k+l) (4-10) 

k=l 

Qo := I. (4.11) 

Note that i? n and JS^ ^ become the classical Bell- and partial Bell polynomials when the 
product in R(X) is commutative, i.e. in the free commutative algebra on X. A non-commutative 
Faa di Bruno Hopf algebra is studied in [6]. However, their definition differs from the present by 
defining the polynomials Q n k without the factor k that associates different factors to different 
permutations of a word (adding up to 1 over all permuatations). 

These Bell polynomials are closely related to the graded Dynkin operator on a connected graded 
Hopf algebra H. For a £ H* , define a graded algebra homomorphism di i— > di(a) : T> — > iJ* as 

<ii(a) = exj = , didj(a) = on * ay. (4-12) 

Proposition 4.4. TTie operator defined as 

oo 

0(a) = EQ«( a )' ( 4 - 13 ) 

71=0 

is a bijection from infinitesimal characters to characters Q : 8(H) — > G(H) with inverse given by 
right composition with the Dynkin idempotent Y~ 1 oD, 

Q-\I3) = 0aY- l oD, (4.14) 

where Y is the grading operator on H and D = S *Y is the graded Dynkin operator. 

Proof. For a 6 q{H) we have 

oo ... 

T(aoY) = Y. E • r T — = Q(a), (4.15) 



thus the result follows from Theorem I3T61 □ 
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The non-commutative Dynkin-Faa di Bruno bialgebra T> is obtained by taking the algebra 
structure of T> and defining the coproduct A-p as 



Ax, (I) = I®I 

n 

A B (4) = J2 B « 



(4.16) 
(4.17) 



k=l 



This extends to all of T> by the product rule Ax>(didj) = Ax>(di)A'p(dj). Thus, e.g. 

Ax,(di) = diOrfi 
A v (d 2 ) = d\®d 2 + d 2 ®d 1 
Ax(did 2 ) = df ®did 2 + did 2 ®df. 



Note that the coproduct is not graded by | • |, thus Proposition 3.7 does not hold for V. By a 
lengthy (but not enlightening) induction argument we can prove: 



Lemma 4.5. The coproduct of the partial Bell polynomials are given as 

n 

Ax>(-B„ jfc ) = ^2 B n ,t®B t 



(4.18) 



Note that B n i = d n , thus (4.171 is a special case of (4.181. Summing the partial B n ^ over k, 
we find the coproduct of the full Bell polynomials 

n 

Av{B n ) = J2 B n,k®B k . 

k=l 



Using Lemma 4.5 and the fact that B n ,k — for k > n, one can easily show that I? is a 
bialgebra. 

Proposition 4.6. T> = R(I) with the non- commutative concatenation product and the coproduct 
Ax form a bialgebra T> which is neither commutative nor cocommutative. 



4.2.2 Pullback along time-dependent flows 

Let F t = J^Lq ij'+ijf be a time-dependent vector field on M where Fj = F^'~ lj 
be the solution operator of the corresponding non- autonomous equation, such that 

y(t)=$ t , Ft y solves y' (t) = F t {y(t)), y(0) = y . 

(Note that &t,F t is n °t & 1-parameter subgroup of diffeomorphisms in t.) 



Let <f> t<Ft 



Lemma 4.7. \37j The n-th time derivative of the pullback of a (time-in 
along the time- dependent flow &t,F t is given as 



d" 



% F ^ = B n {F t )[i>], 



ent) function ip 



(4.19) 



where B n (F t ) is the image of B n under the homomorphism from T> to U{XM.) given by di i— ► 
F^ 1 ^ . In particular 

°" $^ = fl n (F 1 ,...,F n )[# (4.20) 
t=o 



dt n 
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Proof. The non-autonomous vector field F t on A4 corresponds to the autonomous field F t + d/dt 
on M x R, thus |42| yields 



dt 



Of 



Consider the homomorphism induced from d\ i— ► and 9 i— > d/dt, thus i— > i 7 ) 
tion (4.191 follows directly from Definition 4.2 At t = we have di i— > Fj, thus (4.201. 



(i-i) 



Equa- 

□ 



Remark 4.8. Note that (4.5 1 yields a space-time split formula for pullback which is valid also for 
pullback of a time-dependent function tp t . The pullback for t € [0, /i] developed at t = becomes 



(£1 \ ^ X " > 7 77, O 

/ {=0 n=0 



h n 



M=y;- r s n (F 1 ,...,i?;)[^]. 



t=0 n=0 



The Dynkin idempotent relates pullback series with their corresponding time-dependent vector 
fields. Let A be an arbitrary alphabet with a grading | • | : A — ► N + , let Hsh — Ttsh(A) be the 
corresponding graded shuffle algebra and let Bt(a) be defined in (4.3). 

Proposition 4.9. Let a S fl(7ish) and (3 = Q(a) S G(7ish) &e related by the graded Dynkin 
idempotent as in Proposition ^ -4\ Define the time- dependent vector field 

o 

F t = ^B t (a). 
dt 

Then pullback of a time-independent tp along the time- dependent flow $>t.F t is given as 

$* F > = (4.21) 

Proof. We have F t — J2°jL o Fj+iifi where Fj — T v {j\a.j). Developing the Taylor series of ^l t p t ip 
at t = we get from (|4.20[) 



n=0 



Thus 



-,B n {Fi, ...,F n )= F{—B n {l\ax, . . . ,n\a n )) = F(Q n (ai,.. . ,a n )). 



Using (4.151 we obtain the result. 



□ 



4.3 Lie— Butcher theory 

Pullback formulas such as ( |4.21[ ) relate the time derivatives of F t with the spatial derivatives of 
a function tp. We have captured the algebraic structure of the temporal derivations through the 
Dynkin idempotent Y~ l oD: G(Hsh) ~ * fl(^Sh) and its inverse ToY: fl(Wsh) — > G(Hsh)- However, 
the spatial Lie derivation Bt(/3)[tp] cannot be algebraically characterized within this structure. In 
order to do this, we need to refine the Hopf algebra Tish- On the manifold, we obtain a refined 
version of U(XAi) by expanding differential operators in terms of a non-commuting frame on 
XM. 

If the manifold is W 1 and the frame is the standard commutative coordinate frame, the con- 
struction yields the classical Butcher formulation and the Connes-Kreimer Hopf algebra |4J . More 
generally we obtain a Hopf algebra Ti m , built on forests of planar trees, which contains the Conncs 
Krcimcr algebra as a subalgebra. In 7ijv we can represent Lie derivation in terms of tree graftings. 
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4.3.1 Differential operators in U(XM) expanded in a non-commuting frame 

Let XM denote the Lie algebra of all vector fields on M and let g C XM be a transitive Lie 
subalgebra, in the sense that g everywhere spans TM. This means that g defines a frame on the 
tangent bundle. We do not assume that the frame forms a basis. In general dim(g) > dim(.A4), 
and in case of strict inequality we have a non-trivial isotropy subgroup at any point. 

Let U(g) denote the universal enveloping algebra of g. We let q m and U(q) m denote maps 
from M to g and from M to U(g). Since g is assumed to be transitive, we can represent any 



vector field F £ XM. with a function / £ g M as in Section |2.2.1| Similarly, any higher order 
differential operator in U(XM) can be represented as a function in U(q) m . We have the natural 
inclusion g C Q M and J7(g) C U{g) M as constant maps, called frozen vector fields and higher 
order differential operators. 

We identify U(q) m with sections of the trivial vector bundle M®U(g) — » M and for a dif- 
feomorphism <&: M — ► M we define pullback of f £ U(q) m as $*/ = /o$ e U(q) m . Pullback 
in this bundle defines a parallel transport which gives rise to a flat connection with torsion. For 
/, g £ U(g) M we define the connection f[g] £ U(q) m pointwise from the Lie derivative as 

f[g}(p) = (f(p)[g})(p), p£M. 

Similarly, the concatenation in C/(g) is extended pointwise to a concatenation product fg £ U(q) m 
as 

(fg)(p) = f(p)g(p), P£M. 

This is called the frozen composition of f and g. We can also compose / and g as non-frozen 
differential operators /•<? £ U(g) M : 

(f»9)[h] = f[g[h}], for all h e U(q) m . 

This is identical to the composition in U(XAi), which in Section [4~7] was written as F, G i— > FG 
for F,G £ XM. 

It might be illustrative to write out the operations explicitly in terms of a basis {9fe}^ =1 of 
(non-commuting) vector fields spanning g. Writing /, g £ q m in terms of the frame as / = fkdk 
and g = J2e ge^i for f k ,gi £ K M , we have 



/.9 = ^2 fkgidkdi 

k,e 

f[g] = ^2fkd k [g e ]d e 

k,e 

f*g = ^2 fkd k [ge]de + ^ fkged k d e 



k,i k.l 

The connection /[<?], the frozen composition fg and nonfrozen composition fug are related as: 
Lemma 4.10. Let f £ g M and g, h £ U(g) M . Then we have 

%] = g 

f[9h}=f[g]h + g(f[h]), (Leibniz) 

(f*g)[h] ■=f[g[h}] = (fg)[h] + (f[g})[h], 

where I £ U(g) M is the constant identity map. 

The proof is given in [4T| . Note the difference between fg and f»g. In the concatenation the 
value of g is frozen to g(p) before the differentiation with / is done, whereas in the latter case the 
spatial variation of g is seen by the differentiation using /. Interestingly, the work of Cay ley from 
1857 [12 starts with the same result for vector fields expanded in the commuting frame d/dxi. 
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From this lemma we may compute the torsion and curvature of the connection. Let f,g G g . 
We henceforth let [/,#]. := f*g — g»f denote the Jacobi bracket and [f,g] = fg — gf the frozen 
bracket. The frozen bracket is computed pointwise from the bracket in g as [/, g](p) = [f(j>), g(p)]g- 
Writing the connection as V/fif := f[g], we find 

T(f,g) = V f g-V g f-[f,g}. = gf-fg = -[f,g] 
R(f,g)h = VfV g h- V 9 V//i-V [/>5] ./i = 0. 

Note that if g is commutative, then [/, g] = and the connection is both flat and torsion free. 
In this case f[g) is a pre-Lie product generating the Jacobi bracket: f[g) — g[f) = [/, g],, but in 
general f[g] - g[f] = [f,g], - [f,g]. 

The product fug is associative, and thus U(q) m with the binary operations /, g i— * f[g] and 
f-,9 l— > /•<? forms a unital dipterous algebra [33 , however, it has more structure than this. Fol- 
lowing |41j we define: 

Definition 4.11. Let A — I@ A be a unital associative algebra with product f,g i— > /g, and also 
equipped with a non-associative composition /, g i— * f\g] : Ax A — ► A Let -D(-4) denote all / € .4 
such that /[•] is a derivation, 

D(4) = { / G A | /foft] - + <?(/[/*]) }• 



4. We call A a D-algebra if for any derivation / 6 2? (.A) and 



We assume that -D(-4) generates 
any j,/i£iwe have 



<?[/] G D(A) 

m = 9 

f[g[h]] = {fg)[h] + {f[g])[h]. 



(4.22) 
(4.23) 
(4.24) 

A' between D-algebras such 

(4.25) 
(4.26) 
(4.27) 



Definition 4.12. A D-algebra homomorphism is a map T: A 
that T(D(A)) C D(A') and for all g,h e Awe have 

= I 

Hgh) = Hg)Hh) 
Hg([h]) = Hg)[Hh)\. 

4.3.2 Free D-algebras and elementary differentials 



The following definitions are detailed in [44]. Let OT denote the alphabet of all ordered (planar) 
rooted trees: 

ot= { .,:,v1h"£V,Y,..}. 

More generally, we consider decorated ordered rooted trees, where C is a (finite or infinite) set 
of colors. Decorated trees are trees with a color from C assigned to each node. As above, we 
let OT* denote words of trees (forests), let I be the empty word and let Wi,o-> 2 l— * w i w 2 denote 
concatenation for u>i,u>2 € OT*. Identifying C C OT with 1-node trees, we can recursively build 
all words in OT* from C by concatenation and adding roots. For c G C and u> € OT*, define 
B~£(uS) £ OT as the tree with branches u and root c. Often we will be interested in the case where 
C = {•}, just one color. 

As above, let K(OT) denote real polynomials (finite E- linear combinations of words) and 
K((OT)) the dual space of infinite series, such as e.g. 

a = a(I)I + a(»)« + a (1)1 + «(••)•» + a (1)1 + a(V) V + a (•!)•! + a(t»)L + a(«M)*M H 
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On R(OT) we define left grafting (■)[•] : R(OT)xR(OT) -> R(OT) by linearity and for all c G C, 
all reOT and all w, w' G OT* we define: 

w[c] = B+(ou) 
I[u] = u 
t[lulu'] = t[lo]lu + lu(t[lo ]) 
rM W ']] = (™)[J] + (rM)M. 



Compare this with Lemma 4.10 The left grafting t[u>] is obtained by attaching r in all possible 



ways from the left to the vertices of u>, and (tt')[w] is obtained by attaching from left first t' and 
then r on all nodes of lu: 

We henceforth let \ui\ denote the grading counting the total number of nodes, i.e. |c| = 1 for 
all c G C, \u)u'\ = \lu\ + \uj'\ and \lu[lu'}\ = \lu\ + \oj'\. 

Proposition 4.13. Let OT be planar trees decorated with colors C. Consider Af = R(OT) with 
concatenation uj,lo' t— > lulu', left grafting lu,lu' h- > lu[lu'\ and unit I as defined above. N is a free 
D-algebra over C , such that for any D -algebra A and any map v. C — > D(A) there exists a unique 
D-algebra homomorphism map T v : M — > A such that J- V (c) — v(c) for all c G C. 

C c + M 



3! T v 



D{A) 



A 



Definition 4.14. We define the ordered Grossman-Larsor^\ product on J\f for all lu,lu' g OT* as 

LU»LU' = B~{L0[B + {uj')\). 

I.e. we add a root to u>' , graft on lu and finally remove the root again. 
Proposition 4.15. The GL-product is associative and satisfies for all n,n',n" G M 

n[n'[n"}] = (n»n')[n"\ (4.28) 



T„(nun') = T v {n)»T v (n'). 



(4.29) 



Remark 4.16. The classical setting of Cayley, Merson and Butcher is the case where M. = R n 
and q = {d/d xj\ C XM. is the standard commutative coordinate frame. The construction of 
Section 



4.3.1 



produces U(q) m as a D-algebra where the concatenation is commutative. The 
connection is now flat and torsionless, and f[g] becomes a pre-Lie product. The images of the 
trees !F(t), for t g OT, are called the elementary differentials in Butcher's theory (see [S]). These 
are explicitly given in (2.3 1. The images of the forests F{lu), for lu G OT*, are called elementary 
differential operators in Merson's theory (see |35j). 



4.3.3 A generalized Connes Kreimer Hopf algebra of planar trees 

We recall from [U] the definition of the Hopf algebra TLn- On the vector space K(OT) we define 
the shuffle product U, and we define the coproduct as the dual of the ordered GL product, 
i.e. such that 

(a.0)(w)= ]T a{uu (1) )l3{Lu {2) ) for all q,/?£ M((OT))}. (4.30) 

(")a n 

4 The GL product is usually defined in a similar way over non-planar trees. 
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The motivation for this construction is the representation of U (XM.) in terms of a frame g C XM. 
as U(q) m . The shuffle product is the correct product to characterize which series in M((OT)) 
represent vector fields on M. and which represent diffeomorphisms. The composition in U(XA4) 
appears as the product • on U(q) m , thus with the coproduct Ajv the convolution on M((OT)) 
represents composition in U(XA4). 

It remains to give a precise characterization of Ajv and the antipode in TLn- As in the Connes- 
Krcimer case, both Ajv and the antipode can be defined directly in terms of admissible cuts or in 
a recursive fashion. Recursively Apf is given as 

Ajv (I) = 

+ (4-31) 

An (lot) — ujt®1 

where r = B^(u>i) G OT, where uj,l*J\ G OT* and where U- denotes shuffle on the left and 

concatenation on the right, (uii^Ti) U -(lo2®T2) = (u>i U W2)®(tiT2)- The direct formula is given 
as 

A n (lj) = J2 ^V)®^^ (4-32) 

fGFALC(w) 

where FALC denoted Full Admissible Left Cuts, P e (u>) is the shuffle of all the cut off parts, and 
R 1 (uj) is the remaining part containing the root, see |41J . Calculations of the coproduct for forests 
up to order 4 can be found in Table [l] 

Theorem 4.17. Let TLn be the vector space Af = K(OT) with the operations 

product: (iN^adgib) = a U b, 
coproduct: Ajv, 

unit: Ujv(1) = I, 

' 1, tfw=I, 



- , . „ (4-33) 



counit: ejv(w) — 

1 0, else 

Then TLn is a Hopf algebra with an antipode Sn given by the recursion 

S n (lut) = -[i N ((S N ®I) (AjvH U •(/®B+)A / v(wi))) , 
where r = B+{u){) G OT and lu.uji G OT*. 

4.3.4 Lie Butcher series and flows on manifolds 

The set of maps U(g) M from M. to U(q) is a D-algebra where the derivations are the vector 
fields g . Thus, given a set of colors C and a map v: C — > there exists a unique map 
T v : M ^ U (fl) such that for all c e C and all g,h £ J\f we have 

JF„( C ) = i/(c) (4.34) 
^(I)=I (4.35) 
F„{gh) = T v {g)T v {h) (4.36) 
Jv(ffW)=Jv(5)[Jv(/»)] (4.37) 
Fu{g*h)=F v {g)*F v {h). (4.38) 

(4.39) 

Definition 4.18. For an infinite series a G A/"* = R((OT)) a Lie-Butcher series is a formal series 
in f7(g) defined as 

B t {a)= Y, * |w| «(w)^(w). 
weOT* 



21 



Note that AT can be turned into a Hopf algebra two different ways: either as Hsh with product 
U and deconcatenation coproduct A^, or as TIn with the same product U, but where the coproduct 
Ajv is the dual of the ordered GL product. This gives rise to two different convolutions on TV* , 



the frozen composition a, (3 i— > a/3 in Example 3.10 and the non-frozen composition a,/3 i— > a»(3 



as in (4.301. Since the product is the same, we have that the characters and the infinitesimal 



characters are the same as vector spaces 

fl(Wsh) = s(H N ) = { a <E W | a (I) = 0, a(uj Uw')=0 for all lu, u)' € OT* \I } 
G(H S h) = G{H N ) = { a G TV | a(I) = 1, a{u U a/) = a(a;)a(c</) for all w, u/ € OT* }. 

Hovever, the exponential, logarithm, Dynkin and Eulerian idempotents, as well as the antipode 
depend on whether they are based on Hsh or Hn- Which to use in practice depends on which 
operation we want to express on the manifold. Recall that frozen elements of U{q) m are constant 
functions g: M. — > U(g). If g is frozen then f[g] = for all /, and hence f»g = fg. The subalgebra 
of frozen vector fields thus reduces to Hsh- 

We summarize the basic properties of LB-series: Bt sends infinitesimal characters to (formal) 
vector fields on M. and characters to pullback series representing formal diffcomorphisms on M.. 
LB-series preserve both frozen and non-frozen composition and sends left grafting to the connection 
on U( ) M , 

B t (aP) = B t (a)B t ((3) 
B t (a.[3) = B t (a).B t (f3) 

B t {a[P])=B t {a)[B t {(})]. 

Note that if a S G(Hn), then a[(3] represents algebraically the pullback (parallel transport) of /3 
along the flow of a. On the manifold 

B h (a[(3])(yo) = B h (a)[B h (P)](y ) = B h ((3)(<S>(y )) 7 

where 4> is the diffcomorphism represented by a G G{TLn) at t = h. Since the connection is flat, 
the pullback depends only on the endpoint $(2/0) an d not on the actual path. 

There are (at least) three ways to represent a flow y i— > y t = $t(yo) on M., using LB-series: 

1. In terms of pullback series. Find a 6 G(TLn) such that 

V(y(t)) = B t (aM(y ) for any ^eU(g) M . (4.40) 

This representation is used in the analysis of Crouch-Grossman methods by Owren and 
Marthinsen [Hj. In the classical setting, this is called a S-series [IS] . 

2. In terms of an autonomous differential equation. Find (3 £ g(7iAr) such that y(t) solves 

y'(t) = B h ((3)(y(t))-y(t). (4.41) 

In the classical setting, this is called backward error analysis. In the Lie group setting, this 
formulation has, however, never been investigated in detail (but it should!). 

3. In terms of a non-autonomous equation of Lie type (time dependent frozen vector field). 
Find 7 e g(7ish) such that y(t) solves 

y'{t)=^B t { 1 ){y )-y{t). (4.42) 

This representation is used in [3"Tl I3"5] . In the classical setting this is (almost) the standard 
definition of B-series. The connection with the classical B-series is discussed below. 
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The algebraic relationship between a, (3 and 7 is given as follows: 



/3 = aoe e is Euler idempotent in TIn. 

a = exp* (/3) Exponential wrt. GL-product 

7 = uoY~ 1 oD Dynkin idempotent in Hsh(OT). 



a = Q( 7 ) Q-operator (4.13 1 in ft S h(OT) 



Example 4.19. Two examples are of particular interest; the exact solution and exponential Euler 
method. In both cases we consider y'(t) = f(y)-y, where C = {•} and v(») = f. 

The exponential Euler method is particularly simple. Since each step of the method follows 
the flow the frozen vector field f(y„) € 0, the Type 3 LB-series for Exponential Euler must be 
given by 

TEuler = • 

just as in the classical setting] 

Type 3 LB-series for the exact solution can be derived in various ways. Theorem 2.2 in [38] 
derives the exact solution as the solution of 

y' = ft-V, 2/(0) =yo, 

where ft = f(y(i)) € is the pullback of / along the time dependent flow of f t . Letting 
/* = I B tW we obtain 

Fo 7 = Q( 7 )[.] 7 = y- 1 o J B+(Q( 7 )). 
This is known as a combinatorial Dyson-Schwinger equation |24j . Solving by iteration yields 

. + ii + i(V + i) + i(^r + V + 2^ + Y + Li(^ + V + 24^ 



7Exact 



2! 3 p / 4 ,v / 5 j 



• • • 



Remarkably, the LB-series of the exact solution is just a combination of trees, and not commutators 
of trees. Thus in Type 3 LB-series developments of numerical integrators, commutators of trees 
must be zero up to to the order of the method. 

Composition and inverse is simplest for pullback series, Type 1. For series of Type 3, we must 
map to Type 1, compose (or invert) and map back again. If 7,7 are series of Type 3, then the 
basic operations are done as: 

Composition: 7, 7 ^ {Q{i)»Q^f))oY- l oD (4.43) 
Inverse: 7" 1 = Qi^oSoY^oD (4.44) 
Backward error: Log 3 (7) := Q( 7 )oe. (4.45) 



4.3.5 Relations to classical B-series 

The relation between classical B-series and LB-series is detailed in [41"] . Classical B-series are 
expressed in terms of linear combinations of non-planar trees T, resulting in the Connes-Kreimer 
Hopf algebra TLc built from non-planar trees [4] . In the classical setting the connection is torsion- 
free, and concatenation is commutative. Therefore q(Hc) = span(T). That is, q(TLc) is just linear 

5 The classical presentation is 7 = I + •, when the B-series is given in the form (1 2 . 2 b . 
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combinations of trees. This fact is the reason why many discussions in the classical setting can 
avoid series involving forests of trees (words in T* ) . Also the difference between series of Type 1 
and Type 3 is in many papers not emphasized. Since the coefficients k of the Q-polynomials add 
up to one under symmetrization, we find in the classical setting that 

Q(a)(oj) = a{Ti)a{T 2 ) ■ ■ ■ a{r k )u), 

for u) = B + (t\T2 ■ ■ -Tfc), thus formulas involving pullbacks are often expressed directly from B- 
series (Type 3) using the Q-polynomials in this form. Our claim that classical B-series fits best 



into series of Type 3, is based on the trivial observation that the curve y t = B t (a)(y) in (4.421 
solves a differential equation with a time dependent frozen vector field given as 

A question is why the symmetrization cr(r) is natural to include in the classical setting, but 
not in the LB-series setting. To explain the relationship between the two theories we define a 
symmetrization operator: 

Definition 4.20. The symmetrization operator Q : Af — > Af is defined for uo S OT* and r £ OT 

as 

= I, 

Vl{ujt) = n(u) Ufi(r), 
Q(B+(cj)) = B+(Q(u)). 

The shuffle product permutes the trees in a forest in all possible ways, and the symmetrization 
of a tree is a recursive splitting in sums over all permutations of the branches. The symmetrization 
defines an equivalence relation on OT*, that is 

f2(ci>i) = Q,(u>2) < ^=> ~ U>2- 

Let l : Tic — > T~In be an inclusion where a tree is identified with one of its equivalent planar 
trees. In jJTj we show that Ct = Hot ; He — > ~Hn is a Hopf algebra isomorphism onto its image, 
i.e. He is a proper subalgebra oIHn- The adjoint map fl* : H* N — > H* c is given as 



n*(a)(o;) = ct(cj) 



The tree symmetrization a(m) enters exactly such that the LB-series as given in (4.181 maps to 



the classical B-serics in (2.2) 



4.4 Substitution law for LB-series 

The so-called substitution law for B-series |14j can without much difficulty be generalized to LB 
series. 

Consider Af as a D-algebra where the derivations are the Lie polynomials D(AT) = q(Hn) (~)J\f. 
By the universality property of Af, we know that for any map a: C — * D{N) there exists a unique 
D-algebra homomorphism T a : Af — > Af such that .F a (c) = a(c) for all a E C. This is called the 
substitution law. 
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Definition 4.21. For any map a: C — ► D(M) 
there exists a unique D-algebra homomorphism 
a-k : J\f —* J\f such that a(c) = a * c for all c e C. 
The map a* is called a-substitutiorj^J 



CM 



- TV 



N. 



The properties of this substitution law, together with applications of it, will be studied in a 
forthcoming paper. We just mention that many of the useful properties of the substitution law 
follow immediately from the fact that a* : TV — > TV is a homomorphism. For example, for all 
n,n' <E Af we have: 



a*I = l 
a-k(nn) = (<x*n)(a*n) 
a * (n[n ]) = (a -k n) [a * n'\ 
a -k (nun') = (a-k n)u(a * n') 



5 Final remarks and outlook 

Inspired by problems in numerical analysis we have discussed various algebraic structures arising 
in the study of formal diffeomorphisms on manifolds. We have seen that the Connes-Kreimer Hopf 
algebra naturally extends from commutative frames on R™ to non-commutative frames on general 
manifolds. In particular we have presented the Dynkin and Euler operators and non-commutative 
Faa di Bruno type bialgebras in this generalized setting. 

The formalism in this paper has many applications in numerical analysis, and analysis of 
Lie group integrators in particular. However, the underlying structures are general constructions 
with possible applications in other fields, such as geometric control theory and sub-Ricmannian 
geometry. Connections to stochastic differential equations on manifolds is an other topic which is 
worth investigating further. 
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6 In most applications we want to substitute infinite series and extend a* to a homomorphism a*: J\f* — * Af* 
The extension to infinite substitution is straightforward because of the grading, we omit details. 
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51 + U 



•I® 



I 



- «(gii + 1®* + i®i 

+ N§l + «<g)I + I® v 
f 2»»<g* + »(g)I + *(g)M + I(g)«I 

f I®* + «®m + e®L 



^g)I 
V®i 

J®I 

L®i 

••J®! 

•I* 



- i®« + 1® j 

+ V®« + « 

+ *I®« + 2. 

+ Lg>« + I 

ai + m«®« h 
f + !•< 

f l(g)« + I®! 

I + 3«m®« - 

I + Y®« + 

f Lg>« + •! 

+ 3«M®» + 

+ «2(g)« + 2. 

+ L®« + 1< 



+■ »®i 

•(8)1 + »®1 + 1(8 V 

»®I + «®1 + «(8 V + E®^ 

5)1 + «(8 V + 1(8 V 

- m®! + «<8 V + 1(g) 

+ I®m + 2«*gl + «g>«2 + •®I + E®«I 

• + »®L + e®L 

- m®m + 2—(g>l + «®«I + »(8 V + I(8»V 

M®M + *®L + 1(8 V» 

§• + 1(8)1 + 2m®m + «g>«2 + «®L + E®H 
2m®m + m®I + *®*m + «8«2 + I®««I 
M®M + «®«M + »®I« + I(8«I» 
3m + «®«M + I®L« 



.(81 



5» + M®M + «®M* + . 



Table 1: Examples of the coproduct A at, defined in (4.31 1 
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